Speeding problem detection in business surveys: benefits of statistical outlier detection methods
نویسنده
چکیده
Speeding describes the unusually fast responses provided to survey questions. A characteristic of speeders is that answers by-pass cognitive process. Consequently, this low respondent engagement results in the poor quality and validity of data. The issue at hand is how to detect speeders in a survey. The presumption is the use of different statistical outlier detection methods. This paper presents graphical methods for outlier detection, such as: dot-plot diagrams, scatter diagrams, histograms and box-plot diagrams. Furthermore, the quantitative methods for outlier detection in this paper are the z-score, modified z-score, Dixons’ test, Grubbs’ test, Tietjen-Moore test, Rosners’ or the generalized extreme studentized deviate (ESD) test. The performance of these outlier detection methods was observed on completion times of 217 surveys from enterprises which participated in a web survey on the use of statistical methods, and which use them in their business processes. The analysis has shown that none of the observed outlier detection methods were able to detect speeders in an appropriate and satisfactory way as shown by the threshold. The main reasons for this can be found in slowers, the violations of assumptions on normal distribution and in masking. Hence, existing outlier detection methods should be improved and adjusted in future research in order to detect speeders. The introduction of novel speeders detection methods would be a good choice for future research.
منابع مشابه
Multivariate Outlier Detection and Treatment in Business Surveys
Multivariate outlier detection based on the Mahalanobis distance with the BACON-EEM algorithm, the TRC algorithm and the ER algorithm is presented and imputation of outliers and further missing values is discussed. The methods are illustrated with a data set on Swedish municipalities. The relation between outliers, influential observations and selective editing is explored. Finally robust multi...
متن کاملDynamic Outlier Detection in Price Index Surveys
The majority of data sets contain observations that do not conform to the structure followed by the rest of the data. These observations, known as outliers, can be found using a multitude of statistical and non-statistical methods. This paper highlights a generalized system built, specifically for price index surveys, which analysts can use to test different outlier detection methods. It also d...
متن کاملOutlier Detection in Wireless Sensor Networks Using Distributed Principal Component Analysis
Detecting anomalies is an important challenge for intrusion detection and fault diagnosis in wireless sensor networks (WSNs). To address the problem of outlier detection in wireless sensor networks, in this paper we present a PCA-based centralized approach and a DPCA-based distributed energy-efficient approach for detecting outliers in sensed data in a WSN. The outliers in sensed data can be ca...
متن کاملDetection of multivariate outliers in business survey data with incomplete information
Many different methods for statistical data editing can be found in the literature but only few of them are based on robust estimates (for example such as BACON-EEM, Epidemic algorithms (EA) and Transformed rank correlation (TRC) methods of Béguin and Hulliger). However, we can show that outlier detection is only reasonable if robust methods are applied, because the classical estimates are them...
متن کاملSupport Vector Clustering for Outlier Detection
In this paper a novel Support vector clustering(SVC) method for outlier detection is proposed. Outlier detection algorithms have application in several tasks such as data mining, data preprocessing, data filter-cleaner, time series analysis and so on. Traditionally outlier detection methods are mostly based on modeling data based on its statistical properties and these approaches are only prefe...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2017